3.5 |
Class: Tryptophan cluster factors (WC), Alignment |
Note: The three families of this class do not share significant sequence similarities. Therefore, the sequence aligments of their DNA-binding domains will be listed separately for the Myb/SANT family (3.5.1), the Ets family (3.5.2), and the IRF family (3.5.3). |
Aligned Myb/SANT sequences (Family 3.5.1): |
Note that many factors of this family have two or three repeats of myb type, which have been separately aligned and consecutively numbered. |
GKTRWTREEDEKLKKLVEQNG----------TDDWKVIANYLPNRTDV------------QCQHRWQ-KVLNPEL MYB(1)
IKGPWTKEEDQRVIELVQKYG----------PKRWSVIAKHLKGRIGK------------QCRERWH-NHLNPEV MYB(2)
KKTSWTEEEDRIIYQAHKRLG-----------NRWAEIAKLLPGRTDN------------AIKNHWN-STMRRKV MYB(3)
CKVKWTHEEDEQLRALVRQFG----------QQDWKFLASHFPNRTDQ------------QCQYRWL-RVLNPDL MYBB(1)
VKGPWTKEEDQKVIELVKKYG----------TKQWTLIAKHLKGRLGK------------QCRERWH-NHLNPEV MYBB(2)
KKSCWTEEEDRIICEAHKVLG-----------NRWAEIAKMLPGRTDN------------AVKNHWN-STIKRKV MYBB(3)
LKKLWNRVKWTRDEDDKLKKLVEQH-----GTDDWTLIASHLQNRSDF------------QCQHRWQ-KVLNPEL AMYB(1)
IKGPWTKEEDQRVIELVQKYG----------PKRWSLIAKHLKGRIGK------------QCRERWH-NHLNPEV AMYB(2)
KKSSWTEEEDRIIYEAHKRLG-----------NRWAEIAKLLPGRTDN------------SIKNHWN-STMRRKV AMYB(3)
KGGVWRNTEDEILKAAVMKYG----------KNQWSRIASLLHRKSAK------------QCKARWY-EWLDPSI CDC5L(1)
KKTEWSREEEEKLLHLAKLMP-----------TQWRTIAPII-GRTAA------------QCLEHYE-FLLDKAA CDC5L(2)
NKQEWSREEEERLQAIAAAHG----------HLEWQKIAEELGTSRSA------------FQCLQKF-QQHNKAL SNAPC4(1)
KKGYWAPEEDAKLLQAVAKYG----------EQDWFKIREEVPGRSDA------------QCRDRYL-RRLHFSL SNAPC4(2)
KKGRWNLKEEEQLIELIEKYG----------VGHWAKIASELPHRSGS------------QCLSKWK-IMMGKKQ SNAPC4(3)
FMNVWTDHEKEIFKDKFIQHP-----------KNFGLIASYLERKSVP------------DCVLYYY-LTKKNEN NCoR1(1)
ETSRWTEEEMEVAKKGLVEHG-----------RNWAAIAKMVGTKSEA------------QCKNFYF-NYKRRHN NCoR1()
VMNMWSEQEKETFREKFMQHP-----------KNFGLIASFLERKTVA------------ECVLYYY-LTKKNEN NCoR2(1)
ESSRWTEEEMETAKKGLLEHG-----------RNWSAIARMVGSKTVS------------QCKNFYF-NYKKRQN NCoR2(2)
FPDEWTVEDKVLFEQAFSFHG-----------KTFHRIQQMLPDKSIA------------SLVKFYY-SWKKTRT RCoR1(1)
CNARWTTEEQLLAVQAIRKYG-----------RDFQAISDVIGNKSVV------------QVKNFFV-NYRRRFN RCoR1(2)
FPDEWTVEDKVLFEQAFGFHG-----------KCFQRIQQMLPDKLIP------------SLVKYYY-SWKKTRS RCoR2(1)
FNSRWTTDEQLLAVQAIRRYG-----------KDFGAIAEVIGNKTLT------------QVKTFFV-SYRRRFN RCoR2(2)
INARWTTEEQLLAVQGVRKYG-----------KDFQAIADVIGNKTVG------------QVKNFFV-NYRRRFN RCoR3(1)
FPDEWTVEDKVLFEQAFSFHG-----------KSFHRIQQMLPDKTIA------------SLVKYYY-SWKKTRS RCoR3(2)
ELSVWTEEECRNFEQGLKAYG-----------KDFHLIQANKVRTRSVG-----------ECVAFYY-MWKKSER MIER1
GLCAWSEEECRNFEHGFRVHG-----------KNFHLIQANKVRTRSVG-----------ECVEYYY-LWKKSER MIER2
GMTAWTEEECRSFEHALMLFG-----------KDFHLIQKNKVRTRTVA-----------ECVAFYY-MWKKSER MIER3
EMEEWSASEANLFEEALEKYG-----------KDFTDIQQDFLPWKSLT-----------SIIEYYY-MWKTTDR MTA1
EMEEWSASEAMLFEEALEKYG-----------KDFNDIRQDFLPWKSLA-----------SIVQFYY-MWKTTDR MTA2
EMEEWSASEASLFEEALEKYG-----------KDFNDIRQDFLPWKSLT-----------SIIEYYY-MWKTTDR MTA3
IEKCWTEDEVKRFVKGLRQYG-----------KNFFRIRKELLPNKETG-----------ELITFYY-YWKKTPE RERE
HDDAWTKAETDHLFDLSRRFD-----------LRFVVIHDRYDHQQFKK-----------RSVEDLKERYYHICA DMAP1
GSDKWTSLERKLFNKALATYS-----------KDFIFVQKMVKSKTVA------------QCVEYYY-TWKKIMR TRERF1
GSDVWTPIEKRLFKKAFYAHK-----------KDFYLIHKMIQTKTVA------------QCVEYYY-IWKKMIK ZNF541
QWESWSTEDKNTFFEGLYEHG-----------KDFEAIQNNIALKYKKKGKPASMVKNKEQVRHFYYRTWHKITK CRAMP1L
GSDQWKMAERKLFNKGIAIYK-----------KDFFLVQKLIQTKTVA------------QCVEFYY-TYKKQVK C14orf43
QAPEWTEEDLSQLTRSMVKFP------GGTPGRWEKIAHELG------------------RSVTDVT-TKAKQLK DNAJC1(1)
AEEPWTQNQQKLLELALQQYP------RGSSDRWDKIARCVPS-----------------KSKEDCIARYKLLVE DNAJC1(2)
GSKNWSEDDLQLLIKAVNLFP------AGTNSRWEVIANYMNI-----------------HSSSGVKRTAKDVIG DNAJC2(1)
DFTPWTTEEQKLLEQALKTYP------VNTPERWEKIAEAVPG-----------------RTKKDCMKRYKELVE DNAJC2(2)
GFTNWTKRDFNQFIKANEKYG----------RDDIDNIAREVEGKSPE------------EVMEYSAVFWERCNE SMARCA1(1)
KGKNYTEEEDRFLICMLHKMG-----------FDRENVYEELRQCVRNAP----------QFRFDWFIKSRTAME SMARCA1(2)
GFTNWNKRDFNQFIKANEKWG----------RDDIENIAREVEGKTPE------------EVIEYSAVFWERCNE SMARCA5(1)
KGKNYTEEEDRFLICMLHKLG-----------FDKENVYDELRQCIRNSP----------QFRFDWFLKSRTAME SMARCA5(2)
LDPSWTAQEEMALLEAVMDCG----------FGNWQDVANQMCTKTKE------------ECEKHYMKHFINNPL TADA2A
AEGGWTSREEQLLLDAIEQFG----------FGNWEDMAAHVGASRTPQ-----------EVMEHYVSMYIHGNL TADA2B
AGREWTEQETLLLLEALEMYK-----------DDWNKVSEHVGSRTQD------------ECILHFL-RLPIEDP SMARCC1
ATREWTEQETLLLLEALEMYK-----------DDWNKVSEHVGSRTQD------------ECILHFL-RLPIEDP SMARCC2
HVGKYTPEEIEKLKELRIKHG-----------NDWATIGAALGRSASSV-----------KDRCRLM-KDTCNT- DMTF(1)
--GKWTEEEEKRLAEVVHELTSTEPGDIVTQGVSWAAVAERVGTRSEK------------QCRSKWL-NYLNWKQ DMTF(2)
GGTEWTKEDEINLILRIAELDVADENDI-----NWDLLAEGWSSVRSPQ-----------WLRSKWW-TIKRQIA DMTF(3)
Aligned Ets sequences (Family 3.5.2): |
IQLwQFLLELLTDKSCQ-SFISwT-GDGwEFKLSD-PDE-VARRwGKRK-NKPKMNYEKLSRGLR ETS1
IQLWQFLLELLSDKSCQ-SFISWT-GDGWEFKLAD-PDE-VARRWGKRK-NKPKMNYEKLSRGLR ETS2
IQLwQFLLELLHDGARS-SCIRwT-GNSREFQLCD-PKE-VARLwGERK-RKPGMNYEKLSRGLR ETV2
IQLwQFLLELLTDKDAR-DCISwV-GDEGEFKLNQ-PEL-VAQKwGQRK-NKPTMNYEKLSRALR GABPA
IQLwQFLLELLSDSANA-SCITwE-GTNGEFKMTD-PDE-VARRwGERK-SKPNMNYDKLSRALR FLI1
IQLwQFLLELLSDSSNS-SCITwE-GTNGEFKMTD-PDE-VARRwGERK-SKPNMNYDKLSRALR ERG
IQLwQFLLELLADRANA-GCIAwE-GGHGEFKLTD-PDE-VARRwGERK-SKPNMNYDKLSRALR FEV
IQLwHFILELLQKEEFR-HVIAwQQGEYGEFVIKD-PDE-VARLwGRRK-CKPQMNYDKLSRALR ETV3
IQLwHFILELLQKEEFR-HVIAwQQGEYGEFVIKD-PDE-VARLwGRRK-CKPQMNYDKLSRALR ETV3L
IQLwHFILELLRKEEYQ-GVIAwQ-GDYGEFVIKD-PDE-VARLwGVRK-CKPQMNYDKLSRALR ERF
VTLwQFLLQLLREQGNG-HIISwTSRDGGEFKLVD-AEE-VARLwGLRK-NKTNMNYDKLSRALR ELK1
ITLwQFLLQLLLDQKHE-HLICwTSND-GEFKLLK-AEE-VAKLwGLRK-NKTNMNYDKLSRALR ELK3
ITLwQFLLQLLQKPQNK-HMICwTSND-GQFKLLQ-AEE-VARLwGIRK-NKPNMNYDKLSRALR ELK4
LQLwQFLVALLDDPSNS-HFIAwTGRG-MEFKLIE-PEE-VARRwGIQK-NRPAMNYDKLSRSLR ETV1
LQLwQFLVALLDDPTNA-HFIAwTGRG-MEFKLIE-PEE-VARLwGIQK-NRPAMNYDKLSRSLR ETV4
LQLwQFLVTLLDDPANA-HFIAwTGRG-MEFKLIE-PEE-VARRwGIQK-NRPAMNYDKLSRSLR ETV5
IYLwEFLLALLQDKATCPKYIKwTQREKGIFKLVD-SK-AVSRLwGKHK-NKPDMNYETMGRALR ELF1
TYLwEFLLDLLQDKNTCPRYIKwTQREKGIFKLVD-SK-AVSKLwGKHK-NKPDMNYETMGRALR ELF2
IYLwEFLLALLQDRNTCPKYIKwTQREKGIFKLVD-SK-AVSKLwGKQK-NKPDMNYETMGRALR ELF4
THLwEFIRDILLNPDKNPGLIKwEDRSEGVFRFLKS--EAVAQLwGKKK-NNSSMTYEKLSRAMR EHF
THLwEFIRDILIHPELNEGLMKwENRHEGVFKFLRS--EAVAQLwGQKKKN-SNMTYEKLSRAMR ELF3
SHLwEFVRDLLLSPEENCGILEwEDREQGIFRVVKS--EALAKMwGQRKKN-DRMTYEKLSRALR ELF5
IRLYQFLLDLLRSGDMK-DSIwwVDKDKGTFQFSSKHKEALAHRwGIQKGNRKKMTYQKMARALR SPI1
LRLyQFLLGLLTRGDMR-ECVwwVEPGAGVFQFSSKHKELLARRwGQQKGNRKRMTYQKLARALR SPIB
LRLFEYLHESLYNPEMA-SCIQWVDKTKGIFQFVSKNKEKLAELWGKRKGNRKTMTYQKMARALR SPIC
RLLwDYVYQLLSDSRYEN-FIRwEDKESKIFRIVD-PNG-LARLwGNHK-NRTNMTYEKMSRALR ETV6
RLLwDYVYQLLLDTRYEP-YIKwEDKDAKIFRVVD-PNG-LARLwGNHK-NRVNMTYEKMSRALR ETV7
IHLwQFLKELLLKPHSYGRFIRwLNKEKGIFKIEDS--AQVARLwGIRK-NRPAMNYDKLSRSIR SPDEF
Aligned IRF sequences (Family 3.5.3): |
RMRMRPWLEMQINSNQIPGLIWINKEEMIFQIPWKHAAKHGWDINKDACLFRSWAIHTGRYKAG---------EKEPDPKTWKANFRCAMNSLPDIEEVKDQSRNKGSSAVRVYRMLP IRF1
RMRMRPWLEEQINSNTIPGLKWLNKEKKIFQIPWMHAARHGWDVEKDAPLFRNWAIHTGKHQPG---------VDKPDPKTWKANFRCAMNSLPDIEEVKDKSIKKGNNAFRVYRMLP IRF2
KPRILPWLVSQLDLGQLEGVAWVNKSRTRFRIPWKHGLRQDAQQE-DFGIFQAWAEATGAYVPG---------RDKPDLPTWKRNFRSALNRKEGLRLAEDRSKDPH-DPHKIYEFVN IRF3
NGKLRQWLIDQIDSGKYPGLVWENEEKSIFRIPWKHAGKQDYNREEDAALFKAWALFKGKFREG---------IDKPDPPTWKTRLRCALNKSNDFEELVERSQLDISDPYKVYRIVP IRF4
RVRLKPWLVAQVNSCQYPGLQWVNGEKKLFCIPWRHATRHGPSQDGDNTIFKAWAKETGKYTEG---------VDEADPAKWKANLRCALNKSRDFRLIYDGPRDMPPQPYKIYEVCS IRF5
RVRLKPWLVAQVDSGLYPGLIWLHRDSKRFQIPWKHATRHSPQQEEENTIFKAWAVETGKYQEG---------VDDPDPAKWKAQLRCALNKSREFNLMYDGTKEVPMNPVKIYQVCD IRF6
RVLFGEWLLGEISSGCYEGLQWLDEARTCFRVPWKHFARKDLSEA-DARIFKAWAVARGRWPPSSRGGGPPPEAETAERAGWKTNFRCALRSTRRFVMLRDNSGD-PADPHKVYALSR IRF7
GRRLRQWLIEQIDSSMYPGLIWENEEKSMFRIPWKHAGKQDYNQEVDASIFKAWAVFKGKFKEG----------DKAEPATWKTRLRCALNKSPDFEEVTDRSQLDISEPYKVYRIVP IRF8
TRKLRNWVVEQVESGQFPGVCWDDTAKTMFRIPWKHAGKQDFREDQDAAFFKAWAIFKGKYKEG----------DTGGPAVWKTRLRCALNKSSEFKEVPERGRMDVAEPYKVYQLLP IRF9